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Abstract 

The middle levels problem is to find a Hamilton cycle in the middle levels, -M^+i, 
of the Hasse diagram of £>2fc+i (the partially ordered set of subsets of a 2k + 1- 
element set ordered by inclusion). Previously, the best known, from [1], was that 
M2k+i is Hamiltonian for all positive k through k = 15. In this note we announce 
that M33 and M35 have Hamilton cycles. The result was achieved by an algorithmic 
improvement that made it possible to find a Hamilton path in a reduced graph (of 
complementary necklace pairs) having 129,644,790 vertices, using a 64-bit personal 
computer. 
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1 Introduction 



Let B n be the n-atom Boolean lattice, i.e., the partially ordered set of subsets 
of [n] = {1, 2, . . . , n}, ordered by inclusion. The Hasse diagram of B n is iso- 
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Fig. 1. The Hasse diagram of B§ showing a Hamilton cycle in M5. 

morphic to the n-cube, whose vertices are the n-bit binary numbers, with two 
numbers adjacent if they differ in one bit position (Figure f). The ith level of 
B n can thus be viewed as the set of n-bit binary numbers with i ones. 

The middle levels problem is to determine if there is a Hamilton cycle in 
the subgraph M 2 k+i of B 2 k+i induced by the middle levels k and k + 1. The 
heavier lines in Figure 1 show a Hamilton cycle in M 5 . The graph M 2 k+i gained 
notoriety as an example of a connected, vertex transitive graph, all of which 
were conjectured by Lovasz [2] to have Hamilton paths. 

The middle levels problem remains open, in spite of the efforts of many 
[3,4,5,6,7,8]. In 1990, in unpublished work, Moews and Reid verified that M 2k+ i 
is Hamiltonian for 1 < k < 11. In 1999, we extended this for 12 < k < 15 
[1]. In this note we announce that M33 and M35 are Hamiltonian. The new 
results are due to an algorithmic improvement that made it possible to find 
a Hamilton path in a reduced graph of complementary necklace pairs having 
129,644,790 vertices, using a 64-bit personal computer. 

It is known that any connected vertex transitive graph G with V(G) vertices 



has a cycle of length at least y3V(G) [9]. For M 2 k+i, the best absolute lower 
bound is given by the following theorem in [10]. 

Theorem 1 The middle levels graph, M 2k+ \, has a cycle of length at least 



if for every i < j, M 2i+ i has a Hamilton cycle. 

Since our new results show M 2k+ i Hamiltonian for 1 < k < 17, it follows that 
M 2k+ i has a cycle of length at least 0.867^(M 2 fc+i). 
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Recently, it was shown that M2k+i is "asymptotically Hamiltonian" in the 
following sense [11]: There is a constant c such that for all k, M 2 k+i has a 
cycle of length at least (l — c/\fk)V(M 2 k+i)- In the other direction, it has been 
shown that M 2k+1 has a closed spanning walk in which no edge and no vertex 
occurs more than twice [12]. 

In Section 2 we describe a reduction of the problem which lessens the memory 
and computation requirements. In Section 3 we review the Hamilton cycle 
heuristic (SS) from [1]. In Section 4 we describe the improved SSS heuristic 
which made the present results possible. 



2 Reducing the problem 

We sketch here the reduction of the middle levels problem that was was used 
in the earlier work of Dejter [3] and of Moews and Reid. 

Given a string x = X\X 2 ■ ■ ■ x n of n symbols, define the cyclic shift o by 
a{x\x 2 . . . x n ) = X2X3 . . . x n x\. Let a l (x) = a(x) and for % > 0, a l+1 (x) = 
a(a l (x)). Define a relation ~ on the set of n-bit binary numbers (regarded as 
n-bit strings) by x ~ y if and only if y = a l (x) for some integer i. The relation 
~ is an equivalence relation and the equivalence classes are called necklaces. 
Denote the necklace of x by v(x). 

Fix n = 2k + 1. We use necklaces to define a quotient graph of the middle 
levels graph M n . Let N n be the graph whose vertices are the necklaces of the 
vertices of M n (i.e., necklaces of 2k + 1-bit strings with k or k + 1 ones). The 
edges of N n are those pairs v{x)v{y) such that xz G E(M n ) for some z G v{y)- 
Note that the necklace of a 2k + 1-bit binary number with k or k + 1 ones has 
exactly 2k + 1 elements, so N n is smaller than M n by a factor of n. 

The complement b of a binary digit b is 1 if b = and if b = 1. Extend this 
to binary strings by bitwise complement and to necklaces by by v(x) = u(x). 
Note that this is well-defined since y = a l (x) if and only if y = a l (x). 

We use these complementary necklace pairs to further reduce the problem size 
by a factor of 2 by observing that v{x)v{y) G E(N n ) if and only if u(x) u(y) G 
E(N n ). Now define an equivalence relation ~ on the necklaces in V(N n ) by 
X ~ Y if either X = Y or X = Y and denote the equivalence class of ~ 
containing X by p(X). Since every string in X has odd length, X ^ X, and 
every equivalence class p(X) has exactly 2 elements. Construct the reduced 
graph, R n , whose vertices are the equivalence classes {p{X) | X G V(N n )} 
with edges p(X)p(Y) e_E( R n ) if XY G E(N n ) or XY G E(N n ). Observe that 
if Z = u(0 k l k+1 ) then Z = u(0 k l k+1 ) = u(l k k+1 ), and so ZZ G E(N n ). Hence 
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(c) Lifting cycles in N5 to cycle in M5. 
Fig. 2. Lifting a path in i?5 to a cycle in N$ and then a cycle in M5. 



p(Z)p(Z) G E(R n ) and R n has loops, so it is not a simple graph. 

We exploit the fact that R n has loops to show that a Hamilton path from 
the distinguished vertex r\ = p(u(0 k l k+1 )) to the distinguished vertex r\ = 
p(z/(0(01) fe )) in R n can be used to construct a cycle in N n which can be lifted 
to a cycle in M n . The path in R n gives rise naturally to a pair of paths in 
N n where corresponding pairs of vertices from each path are complements 
of each other. Because the distinguished vertices in R n each have incident 
loops, these paths link to form a cycle in N n . Finally, since k and 2k + 1 are 
relatively prime, we can use suitably chosen necklace representatives to extend 
the cycle in N n to a cycle in M n . The case for M 5 is illustrated in Figure 2, 
where for x G V(M 5 ), [x] denotes the necklace v{x) G V(N 5 ) and for necklace 
X G V(N 5 ), {X,X} denotes p(X) in V(R 5 ). 
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s x y z v w u 

Fig. 3. Backtrack search from s has reached u and cannot continue. 
3 The Hamilton cycle heuristic 

Given a graph G and vertices s,t G V{G) we would like to find, if possible, a 
Hamilton path starting at s and ending at t. 

A standard backtrack search attempts to construct such a path by starting at 
s and extending the path to a new vertex as long as possible. Whenever it is 
impossible to further extend the path from a vertex x, the search "backs up" 
to the predecessor, y, of x on the path and the path is extended (if possible) 
from y to one of its other neighbors. Figure 3 illustrates a path P from a 
starting vertex s to a vertex u whose only neighbors in G and v, 

which are already on P. In this case, backtrack search would back up to w, 
then v and eventually back to z, at which time t would be added to the path. 

Posa [13] observed that reaching a dead end in backtrack search could be 
used as an opportunity to modify the current path by a rotation and thus 
possibly to continue. If P = (-ui,-u 2 , . . . ,Uk) is a path in G and there is an 
edge UkUj for some < j < k, then the rotation of p at Uj is the path 
P' = (ui, u 2 , ■ ■ ■ Uj, Uk, Wfc-i, • • • , Uj+i) obtained by deleting edge UjUj + i from 
P and inserting UjU^. Path P' has the same length as P, but a different 
endpoint. Figure 4(a) shows the rotation at x of the path in Figure 3. 

Define PosaSearch to be the Hamilton path heuristic which constructs a path 
P by starting at a vertex s and iterating the following: extend P, avoiding 
t until the end, until no longer possible; When P cannot be extended from 
an endpoint u, select a neighbor v of u and perform a rotation of P at v. 
PosaSearch may not succeed in finding a Hamilton path even if one exists. 
Furthermore, it may run indefinitely. 

For the path P in Figure 3, PosaSearch can only transform P into one of the 
paths shown in Figure 4. None of these transformations will ever allow the 
vertex, t, to be added to the path. 

Rotations transform a path and alter the order of vertices on it. For a path 
P, let P(i) be the ith vertex of P, let pos(P, v) be the position of vertex v on 
P, and let \P\ be the number of vertices on P. If P is a path with endpoint u 
and if P(j) is adjacent to u, we can rotate P at P(j) to arrive at a new path 



(a) Path ending at y after rota- 
tion at x. 



(b) Path ending at x after rota- 
tion at s. 




(c) Path ending at w after rota- 
tion at v. 



(d) Path ending at w after rota- 
tion at s followed by rotation at 
u. 



Fig. 4. Possible endpoints after one or more rotations 



P' . The position of a vertex v on P' is then given by 



pos(P', v) 



pos(P, v) if pos(P, v) < j 

\P\ — pos(P, v) + j + 1 otherwise. 



Similarly, the vertex now in position i on P' is 



P'd) 



P(i) if i < j 

P(\P\ — i + j + 1) otherwise. 



(2) 



The SS heuristic from [1] uses a variation of PosaSearch that looks ahead 
before performing any rotation. SS extends the path as is done in PosaSearch 
until all neighbors of the endpoint are already on the path. If the path is 
not already a Hamilton path, it then uses breadth- first search (BFS), and 
repeated application of eqs. (1) and (2) to search for a sequence of one or 
more rotations guaranteed to result in a path which can be further extended. 
If such a sequence is found, the sequence of rotations is performed and the path 
is extended. If no such sequence exists, the SS heuristic terminates without 
having found a Hamilton path. Figure 4 shows four paths obtainable by one 
or more rotations from the path shown in Figure 3, In this example, there are 
only four possible endpoints, u, v, w, and x. The path cannot be extended 
from any of these, so SS would terminate after evaluating all four. 
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4 The new SSS heuristic 



The original work on the SS heuristic [1] used a 400MHz Intel Pentium-II 
system with 192MB RAM. Results were later run on a 2.4GHz Intel Pentium 4 
system with 512MB of RAM (See [14]). We recently acquired an AMD Athlon 
3500+ system with 2GB of RAM and converted the program to use 64-bit 
values for most internal computations. With this system and the SS heuristic 
we were able to find a Hamilton path in R 33 in about 3.5 weeks. The program 
ran entirely in memory, using approximately 1GB of RAM. However, it was 
unlikely that i? 35 would be feasible. 

Therefore, using performance profiling tools, we analyzed the code perfor- 
mance and discovered that, contrary to expectations, a large part of the time 
was spent on performing the rotation operations, rather than on the BFS 
to find promising sequences of rotations. As a result, we made the following 
changes to the heuristic that resulted in dramatic speed improvements. 

First, when BFS finds a sequence of rotations that will enable an extension 
of the path, instead of actually performing each rotation of the path the SSS 
heuristic encodes the sequence of rotations as a list of ordered pairs, each 
representing the number of vertices on the path at time of rotation and the 
rotation point. This list, together with repeated application of eqs. (1) and 
(2), suffices to calculate the actual vertex in a given position or the current 
position of a given vertex, when the sequence of rotations represented by the 
list of ordered pairs has not been explicitly performed. 

Secondly, the SSS heuristic periodically (but rarely) goes ahead and performs 
all the rotations encoded by the list of ordered pairs. Given a path v , . . . , vi, 
if we perform a rotation at Vi to make v i+ i the new endpoint, then there is 
a section of i + 1 vertices up to i>j that will not be moved and a section of 
I — i vertices after Vi that will be reversed. We may describe each of these 
sections (or blocks) of the path by a triple, {s,n, d}, where s is the index of 
the first vertex in the block, n is the number of vertices in the block and d is 
a direction indicating whether the block is now in original (forward) order or 
reversed. Our single rotation at V; L is represented by the triples {0,i + 1, — >} 
and {i + 1, 1 — i, <— }. As we process a rotation from the list of ordered pairs 
we first add a block in the forward direction, representing any vertices added 
by extension since the last rotation. Next we determine the block containing 
the new endpoint and (possibly) split it into two blocks. All blocks before the 
new endpoint remain unchanged, while the sequence of blocks after the new 
endpoint, as well as the direction indicator of each such block, is reversed. 
After the list of blocks has been created representing all the stored rotations, 
the path can be copied block by block to effect the series of rotations in a 
single copy operation. 
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These modifications allow a sequence of rotations to be accumulated, without 
being performed, at the relatively smaller cost of increasing time for calculation 
of vertex position or vertex in a given position. The accumulated sequence of 
rotations is performed on a schedule discussed in the next subsection. This 
significantly reduced the average work per rotation. 

Figure 5 illustrates the process for a list of three rotations, as seen in one run 
of the program. The subscripts in each block indicate the order of creation 
of the block. For computational convenience, a rotation point, v, is actually 
represented in the list of rotations by its position on the path. 



4-1 Graph representation and storage 

In the middle levels of £> 35 , the reduced graph, i? 35 , has over 129 million ver- 
tices. An array of 32-bit integers with one entry per vertex takes approximately 
half a gigabyte of memory. Our system had only 2GB of RAM, so storage for 
basic information on vertices and paths was limited. In this environment, it is 
infeasible to store adjacency lists, so we recalculate adjacency lists as needed 
rather than storing them (see [14]). We reduced all arrays to the smallest na- 
tive size (char, short, int, or long) that was sufficient for the data and still 
experienced unacceptable levels of memory swapping. 

The vertices of -R35 are represented by the lexicographically least elements (as 
sets) of each 35-bit necklace with 17 ones. Thus the two higher order bits 
will always be and we need only use 33 bits for the internal representation. 
However, we store the lower 32 bits in 32-bit integer arrays and add the high- 
order bit according to the position of a vertex in the array. This saves half a 
gigabyte of RAM as compared to using 64-bit long integers. We still use 64-bit 
long integers for internal calculations, but none of our major storage arrays 
needs larger than 32-bit entries. 

When copying the path to execute a series of rotations, we copy it to the 
position array (Pos) and then rebuild the position array after the copy as 
this results in less swapping than using memory that has not recently been 
referenced, such as the parent array used in building a BFS tree. 

Fine tuning in this way resulted in the program using approximately 85% 
to 90% of the CPU for most of the run while finding a path in Some- 
what higher CPU usage was observed toward the end of the run, even though 
BFS trees, with their storage requirement, were being computed much more 
frequently. 
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Saved list of three rotations: {35,15}, {36,9}, {40,32} 
1. {35, 15} - 35 vertices on path, rotation at v±5 
Create initial block 

0,35,- 

Split block 0, into blocks and 1 



0,16,- 


16,19,-i 


Rotate block 1 


0,16,- 


16,19,-i 



2. {36, 9} - 36 vertices on path, rotation at vg 



Add block 2 



0,16,-o 


16,19,-i 


35,1,-2 




Split block 0, into blocks and 3 




0,10,-o 


10,6,-s 


16,19,-! 


35,1,-2 


Rotate blocks 3 through 2 


0,10,-o 


35,l,- 2 


16,19,-! 


10,6,-3 



3. {40, 32} - 40 vertices on path, rotation at V32 



Add block 4 



0,10,-o 


35,1,-2 


16,19,-i 


10,6,-3 


36,4,-4 



Split block 3, into blocks 3 and 5 



0,10,-o 


35,1,-2 


16,19,-i 


13,3,-3 


10,3,-5 


36,4,-4 


Rotate blocks 5 and 4 


0,10,-o 


35,1,-2 


16,19,-i 


13,3,-3 


36,4,-4 


10,3,-s 



Final path: 

V ,. . . ,Vg,V 35 ,V W ,...,Vu,Vl5, VIA , V13 ,V 3 g , V 38 , V 37 , V 36 ,Vl ,V U , V\i 

V „ 'S^^ „ „ v ' S v ' 

2 1 3 4 5 

Fig. 5. Performing a sequence of rotations from a list. 

4-2 Performance of the heuristic on the middle levels graph 



The SSS heuristic was applied to search the reduced middle levels graphs 
-R2/C+1, k — 1, 2, ... 17, to try to find a Hamilton path from vertex p(u(0 k+1 l k )) 
to vertex p(z/(0(01) fc )), and therefore a Hamilton cycle in M 2 k+i- A Hamilton 
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path meeting these requirements was found for each k < 17. 

The results are summarized in Table 1. We measured elapsed time using a 
timer with a resolution of 1 second. Runs that start and complete in the 
same second show a time of seconds. Table 1 also shows earlier results 
obtained with the SS heuristic on a 2.4GHz Intel Pentium 4 system with 
512MB of RAM. Many factors affect the difference in performance between 
the two systems, including operating system and compiler differences as well 
as processor, RAM, and disk speed. The newer system ran the SS heuristic 
approximately twice as fast as the older system. Thus for k = 15, the largest 
value for which we have both results, we see that hardware doubled the speed 
and the switch to the SSS algorithm achieved a further 16-fold increase. 



Table 1 

Running time to find a Hamilton cycle in the middle levels graph 



k 


n = 2k + 1 


# vertices in R n 


# vertices in M n 


Time (Sees) 










SS 


SSS 










(32-bit) 


(64-bit) 


8 


17 


1,430 


48,620 








9 


19 


4,862 


184,756 








10 


21 


16,796 


705,432 


1 





11 


23 


58,786 


2704,156 


5 


2 


12 


25 


208,012 


10,400,600 


105 


10 


13 


27 


742,900 


40,116,600 


1,732 


99 


14 


29 


2,674,440 


155,117,520 


24,138 


799 


15 


31 


9,694,845 


601,080,390 


307,976 
(3.6 days) 


9,446 


16 


33 


35,357,670 


2,333,606,220 




106,118 
(1.2 days) 


17 


35 


129,644,790 


9,075,135,300 




1,765,497 
(20.4 days) 



As noted earlier, the deferral of rotations also involves a cost. We experimented 
with the number of deferred rotations using values ranging from \ogV(G) 
to \JV{G). Our results were obtained using a value of \JV{G), or 11,387 
for the reduced graph, i? 35 . There is some evidence to suggest that a small 
performance improvement could be obtained by varying this number during 
the running of the program, particularly in very large graphs. 
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